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Abstract 

We completely characterise the complexity in the decision tree model of computing composite 
relations of the form h = g o (/i, . . . , /"), where each relation p is boolean- valued. Immediate 
corollaries include a direct sum theorem for decision tree complexity and a tight characterisation 
of the decision tree complexity of iterated boolean functions. 

1 Introduction 

The decision tree model captures the complexity of computing functions / : X™ — )■ y in a setting 
where the quantity of interest is the number of queries to the input (see [1] for a good review of 
the model). We are allowed to query individual elements Xi £ X at unit cost, and seek to compute 
/(xi, . . . , Xm) using the minimal number of queries. 

Here we will be interested in the decision tree complexity of composite functions, i.e. functions 
of the form 

h{x)=g{fHx'),...,nxn), 

where E X™'\ One strategy to compute h is first to replace g with the function g given by 
substituting the values taken by any constant functions /* into g, and then to compute g using 
efficient algorithms for . . . , as black boxes. In other words, we use an efficient algorithm for 
computing ^(yi, . . . , and replace each query to a variable i/i with the evaluation of using 
an optimal algorithm. As the inputs are independent, it is natural to conjecture that this is in fact 
always the most efficient way to compute g. However, a moment's thought shows that this cannot 
be the case in general: indeed, consider the function h{x) = g{f{x)), where / : {0, l}" — )• {0, !}"■ 
is the identity function /(x) = x, and g : {0, 1}" — >• {0, 1} is the function extracting the first bit 
of the input, g{y) = yi. Then / requires n queries to be computed, whereas h can be computed 
with 1 query. Another simple, but arguably less trivial, example of this phenomenon is illustrated 
in Figure 1 below; the key point in this example is that the worst-case input to the functions /* 
may not produce an output which is a worst-case input to g. 

We will show that, nevertheless, the simple algorithm above is optimal if the functions /* are 
boolean-valued, i.e. when Y = {0, 1}. In fact, we prove such a result for the generalisation of the 
decision tree model to computing relations. A decision tree computes a relation / C x Y if, 
on input x G X™, it outputs y such that (x,y) G / (see Section 1.2 for a formal definition of the 
model). We call this result a composition theorem for decision tree complexity. 
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(a) Optimal tree for / (b) Optimal tree for g (c) Optimal tree for h 



Figure 1: Let / : {0, 1}^ {0, 1, 2} and g : {0, 1, 2}^ ^ {0, 1, 2} be defined by the decision trees 
above (where edges correspond to elements of {0, 1} or {0, 1, 2} in ascending order from left to 
right). Set /i(xi, X2, X3, X4) = (7(/(xi, X2), /(xs, X4)). Then / and g require 2 queries each to be 
computed, but h can be computed using only 3. 



1.1 Related work 

This work can be seen as fitting into the framework of "direct sum" results in query complexity, 
in which there has been significant recent interest (see the thesis [2] for a good review). The basic 
aim of such work is to show that, given n independent computational problems, the most efficient 
way to solve them all is simply to use an optimal algorithm for each problem separately. While 
this claim is intuitively obvious, it is not necessarily easy to prove, and in some cases is even false; 
see [2] for some examples. 

In particular, Jain, Klauck and Santha have shown [5] such a direct sum theorem for determin- 
istic and randomised decision tree complexity. As discussed in [5], previous work had dismissed 
this question as uninteresting, but the proof is not immediate. In the deterministic case, the result 
of [5] is proven by showing that, given a decision tree T computing n independent outputs, one 
can produce n independent trees Tj, one for each output, and lower bounding the depth of T in 
terms of the depth of the trees Tj. This approach does not seem suitable for proving a composition 
theorem, as given a tree computing h, it is not necessarily possible to produce individual trees for 

. . . ,/". On the other hand, the composition theorem given here in fact implies a direct sum 
theorem for decision tree complexity (see Section 2.1 for this and other corollaries). 

Similar, but more complicated, composition theorems to the present work have been proven for 
quantities which are important in the study of quantum query complexity. H0yer, Lee and Spalek [3] 
proved that the so-called non-negative weight adversary bound (which we will not define here; 
see [3] for details) obeys a composition theorem. Interestingly, their proof is based on generalising 
the query model to weighted queries, similarly to the approach taken here. A composition theorem 
was later proven for the general adversary bound (in two parts; a lower bound by H0yer, Lee and 
Spalek [4], and an upper bound by Reichardt [7]), which was then used by Reichardt to infer a 
composition theorem for quantum query complexity [8]. 

1.2 Relations and decision trees 

We use essentially the standard model of decision trees (see e.g. [1] for definitions, and [5] for the 
extension to relations). Fix finite sets X and Y, and integer m. A decision tree T is a rooted 
ary tree whose internal vertices are labelled with indices between 1 and n, whose edges are labelled 
with elements of X, and whose leaves are labelled with elements of y. T computes a function 
/t '■ X^ — )• y as follows: starting with the root, it queries the input variable indexed by the label 
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of the current vertex and, depending on the outcome, evaluates the corresponding subtree. When 
a leaf is reached, the tree outputs the label of that leaf. The depth of T is the maximal length of 
a path from the root to a leaf (i.e. the worst-case number of queries used on any input). 

We will consider the complexity of computing relations in the decision tree model. Let / C 
X" X Y. We write dom/ = {x : 3y,{x,y) G /} and range/ = {y : 3x,{x,y) G /}, and also 
f{x) = {y : {x,y) 6 /}, f~^{y) = {x : {x,y) G /}. / is said to be a partial function if |/(a;)| = 1 
for all x G dom/; in this case, we abuse notation and write /(x) for the single element within the 
set f{x). / is a total function if in addition dom / = X". We say that / is constant if there exists 
y & Y such that f~^{y) = dom/, and trivial if, for all y e Y, f~^{y) = dom/. Note that trivial 
relations are constant, and that the empty relation / = is trivial. 

We say that a decision tree T computes a relation / if, for all x G dom/, on input x, T outputs 
y such that {x, y) G /, i.e. fxix) G /(x) for all x G dom/. If this holds, we say that fx is consistent 
with /. The decision tree complexity D{f ) is the minimal depth over all decision trees computing 
/. This quantity is also known as the exact (classical) query complexity of /. 

This can be generalised to define a weighted variant of decision tree complexity -D (/, [wi , . . . , Wn] ) , 
which is equal to the minimal worst-case number of queries required to compute /, given that query- 
ing the i'th element of the input costs Wi qiieries. This can again be expressed as the minimal depth 
of a decision tree computing / with weights wi, . . . ,Wn on the inputs, where the depth of a de- 
cision tree T with weights on the inputs is simply the maximal sum of the weights of the inputs 
specified by the labels of vertices of T, on any path from the root to a leaf. The usual decision tree 
complexity is recovered by setting D(f) = D{f, [1, . . . , 1]). 

For / C X"- X y, let fi^^h C X" x F be the modification of / we obtain by substituting b as the 
z'th input to /. That is, 

/if-6 = {{x,y) e f ■.Xi = b}. 

We can use this definition to write down the following characterisation of weighted decision tree 
complexity (indeed, this could even be seen as its definition). 

Fact 1. Let f C X" x Y, and wi, ... ,Wn G N. Then D{f, [wi, . . . , Wn]) = if and only if f is 
constant. If f is not constant, 

D{f,[wi,..., Wn]) = miiimaxZ)(/j^6, [wi, . . . ,Wn\) + Wi. 
ie[n] bex 

Proof. If / is constant, the tree can simply output y such that f^^{y) = dom / using no queries. 
On the other hand, if there exists a tree that computes / and makes no queries, then there is only 
one fixed value y that is output by the tree; so for all x G dom/, (x, y) G /; so f~^{y) = dom/. 
If / is not constant, then for any z, we can produce a tree computing / with weights wi, . . . ,Wn 
by first querying the i'th variable, obtaining outcome then using an optimal tree for fi^^i) with 
weights ijoi, . . . ,Wn- This proves the "<" part. For the ">" part, given a tree computing / which 
makes its first query to the i'th variable, where i achieves the minimum above, we can obtain a 
tree for computing /i<_fe, for any 6 G X, by simulating this tree, replacing the first query with the 
constant b and hence saving weight Wi. □ 

In the above fact, and the rest of the paper, we write [n] for {!,...,«}. 
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1.3 Composition of relations 

For i e [n], let /* C X'^^ x Y, and g CV^ x Z. Set m = J2i "^i- The composition g o{f^,..., /"-) 
is defined as 

{{x,z)eX^xZ:^ye F", {x\ y^) G f A (y, z) € 5}, 

where we write x = (x^, . . . ^x^), with G X"**. If / and g are total functions, this simply says 
that for all x&X"^, 

h{x)=g{f{x'),...,nxn). 

In words, an algorithm to compute the relation h on input x has to output an arbitrary z such that 
z G g{y), for some y such that for all i, yi G P{x^)- 

Assume that /* is non-trivial for all i. If some of the relations P are constant (i.e. {P)~^{bi) = 
dom/*, for some 6j G Y), this expression for h can be simplified by removing these inputs and 
modifying g appropriately. Write g for the relation obtained by fixing constants to the inputs to g, 
where possible. In other words, if iS C [n] is the set of indices i such that P is not constant, then 

9 = {{y,z) e g ■.yi = bi,i ^ S}, 

h = go {f\ ...,n = {ix,z) eX^xZ:3ye Y^^i G S, (x^y^) G f A {y,z) G 5}- 

2 A composition theorem 

We are finally ready to state and prove our main result. 

Theorem 2. Fix n G N and finite sets X, Z. Let g C {0, 1}" x Z, and for 1 < i < n, let 
p C X'^i X {0, 1} be non-trivial. Set m = J2i^i- Define h C X'^ x Z by 

h = go{f\...,r). 

Also let g be the relation given by substituting constant inputs into g. That is, for each i, if 
{P)~^{bi) = dom/*, then g is replaced with gi^bi (o-s /* is non-trivial, there is exactly one such 
bi). Then 

D{h) = D{g,[D{f^),...,D{f-)]). 

Proof. To see that D{h) < D{g, [D{f^), . . . , £>(/")]), observe that the algorithm discussed in the 
introduction achieves this complexity. Formally, we compute h as follows: first form a new relation 
g by replacing constant relations /* with corresponding constants as inputs to g; then use an optimal 
tree for computing g with weights D{f^), . . . , D{f""'), replacing each query to the z'th variable with 
the use of an optimal tree for The more interesting part is the corresponding lower bound, the 
proof of which proceeds by induction on D{h). 

For the base case, take D{h) = 0. Then h is constant. By definition, this implies that there 

exists z such that the following holds: For all x = {x^ , . . . ,x^) such that there exist y' G {0, 1}" 
and z' £ Z with (x*, y[) G /* for all i and (y' , z') G g, there exists y G {0, l}" such that {x\yi) G /* 
for all i and (y, z) G g. Let 6" = {i : /* is not constant}. For all i & S, let a*, 6* G X™» be picked 
such that (a*, 0) G /, (a*, 1) ^ /, {¥, 0) ^ / and (6*, 1) G /. As /* is not constant, such elements a*, 
6* exist. Further, for all i ^ S, let q satisfy (/*)~^(ci) = dom/*. As /* is not trivial, q is unique 
and there exists G X^^ such that (d*, Cj) G but (d*, c'-) ^ /* for c'- 7^ Cj. 

For s : S" — 7- {0, 1}, let the string y{s) G {0. 1}" be defined as follows: y{s)i = s{i) for i G S, and 
y{s)i = Ci for i ^ S. For each distinct input x G {0, 1}™ whose i'th block is equal to a' or 6' (for 



4 



i G S) or (for i ^ S), a different string y(s) is obtained, and if {x, y) G (f^, . . . , /"), y = y{s). In 
particular, by choosing s appropriately one can produce any y G domg. As there is exactly one y 
such that {x,y) £ (/^, . . . ,/"), and h is constant, {y,z) £ g. Thus, for all y £ domg, {y,z) £ g. 
Hence g is constant and D{g, [D{f^), . . . , D{f^)]) = as required. 

For the inductive step, assume that D{h) = D{g,[D{f^), . . . , D{f^)]) for all relations h = 
g o (/^, . . . , /"■) such that /* is non-trivial for all i, and such that D{h) < k, for some k. Now let 
h = g o (/^, • • . , f"') satisfy D(h) = k + 1. As /i is not constant, by Fact 1 

D(h) = min maxL)(/ij^{,) + 1 

ie[m] beX 

= mill niin m&xD{g o (/\ . . . , /V^, . . . , /")) + 1. 

ie[n] je[mi] beX 

As /* is non-trivial for all i, for any pair i,j there exists b' such that /j^^/ is non-trivial. Thus, 
for b achieving the above maximum, is non-trivial. Let i,j be the indices achieving the above 
minimum. Then, by the inductive hypothesis, 

maxZ)(<7 o {f\ . . . , . . . , /" )) = maxi?0, [D{f), . . . , Z)(/j^,), . . . , !?(/")]), 
bex bex ■' 

where g is the relation obtained by substituting 6^ as the fe'th input to 5, for all A; 7^ i such that 
{f'^)~^{bk) = dom/'^, and also substituting bi as the i'th input to g if {fj^f,)~^{bi) = dom/j^^. 
We thus have 

D{h) = ma^DCg,[D{n,---,D{fj^b),---,D{n]) + l 

bex ■' 

= Did, [Dif), . . . , maxZ)(/j^,), . . . , !)(/")]) + 1, 

b&X ■' 

where the second equality holds because decreasing any of the weights cannot increase the decision 
tree complexity. First assume that, for b achieving the maximum, /j^^ constant. Then g = g 

and 

D{h) = D{-g,[D{f),...,ma^D{fj^,),...,D{n]) + l 

> D{-g,[Dif'),...,D{n-l,...,Din]) + l 

> Di-g,[D{f ),..., Din]) 

as required. The first inequality follows from D{p) < max^gx D{fj^^) + 1, while the second holds 
because, to find a decision tree T which computes g with weight D[p) on the z'th variable, we 
can just use an optimal decision tree T' for g with weight D{f^) — 1 on the i'th variable. As T' is 
optimal, we only encounter the i'th variable at most once on any path from the root to a leaf, so 
the depth of T is upper bounded by the depth of T' , plus 1. 

On the other hand, now assume that, for h achieving the maximum, /j^^ is constant. Then 
it must hold that D{f^) = 1 (if /* were already constant, querying any of the variables on which 
it depends would not achieve the overall minimum; if /* is not constant, and there exists b' £ X 
such that /j^fo/ is not constant, such a b' would achieve the maximum over b; hence D(f^) = 
1). So we can produce a decision tree for g with weights D{f^), . . . , D{f^) by computing /* 
using one query to the j'th variable and then using an optimal decision tree for g with weights 
D{f), . . . , D{fn, 0, D{f+^), Din. Thus 

Dig, [Dif), Din) < ^^^Di=g, W), • • • , ^(/j^b), • • • , Din) + 1 = Dih) 

b&X ■' 

as required, completing the proof. □ 
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2.1 Corollaries 

We finish by noting a few simple corollaries of Theorem 2 that can be useful in amplifying sepa- 
rations between decision tree complexity and other complexity measures. We begin by considering 
the case where all the relations /* are the same. 

Corollary 3. Fix / C X^' x {0, 1} and g C {0, 1}" x Z. Define h C X'^'^ x Z as h = go{fJ, . . . J). 
ThenD{h)=D{f)D{g). 

Proof. If / is constant, then D{h) = 0. Otherwise, take f^ = --- = f"' = fin Theorem 2. Then 
D{h) = D{g, [D{f), D{f)]) = D(f)D{g). □ 

Second, we observe that the composition theorem can be used to characterise the decision tree 
complexity of an iterated boolean function, via the following corollary. 

Corollary 4. For any f : {0, 1}" {0, 1}, let the k-fold iteration off, : {0, 1}"'' {0, 1}, be 
defined as follows. Set f^^^ = f, and for k > 2, split the input x G {0, 1}" into blocks x^, . . . ,x" 
o/n'^-i bits each, and set f'^^\x) = f{f^^-^\x^), . . . , f^^-^^x"^)). Then D{f^''^) = D{f)^ . 

Proof If / is constant, then D{f^^^) = = D{f)''. Otherwise, by Theorem 2, for k>2, D{f^''^) = 
D{f, [i:>(/('=-i)), . . . , = D{f)D{f^^-^^). The claim follows by induction. □ 

Nisan and Szegedy used an iterated version of the "not all equal" function on 3 bits to prove an 
asymptotic separation between decision tree complexity and polynomial degree [6], based on lower 
bounding decision tree complexity by sensitivity. Corollary 4 gives a direct proof of this result. 

Finally, we obtain the following special case of a direct sum result of Jain, Klauck and Santha [5]. 

Corollary 5. For any non- trivial /I, ...,/" C X^' X {0, 1}, let /i C X^" x {0, 1}" be defined by 
h={f\f\..., r). Then D{h) = E.^h ^(/O- 

Proof. Take5(yi,...,y„) = (yi,...,yn) in Theorem 2. Then Z)(/i) = Z)(5, [L>(/i), /)(/")]). Let 
5 = {i : f is not constant}. Then D{g, [D{f^), . . . , D{f^)]) = EieS ^if) = E^e[n] D{f'). □ 

The proof technique of [5] allows the range of each relation /* to be arbitrary, rather than being 
restricted to {0, 1} as here. 

Acknowledgements 

I would like to thank Graeme Mitchison and Toby Cubitt for helpful discussions and comments on 
this work, and an anonymous referee for inspiring it. 

References 

[1] H. Buhrman and R. de Wolf. Complexity measures and decision tree complexity: a survey. 
Theoretical Computer Science, 288:21-43, 2002. 

[2] A. Drucker. The Complexity of Joint Computation. PhD thesis, MIT, 2012. 



6 



[3] P. H0yer, T. Lee, and R. Spalek. Tight adversary bounds for composite functions, 2005. 
quant-ph/0509067. 

[4] P. H0yer, T. Lee, and R. Spalek. Negative weights make adversaries stronger. In Proc. 39^^ 
Annual ACM Symp. Theory of Computing, pages 526-535, 2007. quaiit-ph/0611054. 

[5] R. Jain, H. Klauck, and M. Santha. Optimal direct sum results for deterministic and randomized 
decision tree complexity. Inf. Proc. Lett, 110(20):893-897, 2010. arXiv: 1004.0105. 

[6] N. Nisan and M. Szegedy. On the degree of Boolean functions as real polynomials. Computa- 
tional Complexity, 4(4) :30 1-313, 1994. 

[7] B. Reichardt. Span programs and qiiantum query complexity: The general adversary bound is 
nearly tight for every boolean function. In Proc. 50^^ Annual Symp. Foundations of Computer 
Science, pages 544-551, 2009. arXiv: 0904. 2759. 

[8] B. Reichardt. Reflections for quantum query algorithms. In Proc. 22^'^ ACM-SIAM Symp. 
Discrete Algorithms, pages 560-569, 2011. arXiv: 1005 . 1601. 



7 



